Automatic detection of uncertainty in spontaneous German dialogue
نویسندگان
چکیده
Uncertainty is ubiquitous in natural human communication. Human listeners assess the speaker’s degree of uncertainty at any time in communication and use this information to shape dialogue. In contrast, currently available computer systems dealing with spoken language are usually not built to perform this task. The ability to detect uncertainty would likely lead to more natural human-computer dialogue. In order to detect uncertainty automatically, we extract linguistic, paralinguistic and dialogue-related features from the Kiel Corpus, a corpus of naturalistic task-oriented spoken German. We then use these features to train a random forests model. Our experimental results show that relatively high classification accuracy can be obtained while employing only 64 well-chosen features (73% accuracy, 69% F1). To our best knowledge, this is the first study of automatic uncertainty detection using German speech data as well as the first achieving good performance on everyday speech.
منابع مشابه
Disfluent Lengthening in Spontaneous Speech
We investigate lengthening in spontaneous speech with the aim in mind to use it as a time-management strategy in incremental spoken dialogue systems. lengthening is a common feature of speech, occurring regularly near the edges of intonation phrases. It behaves similar to disfluencies when it occurs in places remote from phrasal boundaries. Disfluencies have proven useful in incremental spoken ...
متن کاملUsing Phrase Accent Information for Dialogue Act Recognition in Spontaneous German Speech
This paper describes an approach in which phrase accent information is used for dialogue act recogniti on in German spontaneous speech This application is an example of how automatically computed pros odic information can be used in automatic speech re cognition Usually the important intention conveyed by an utterance is found in the focused area which is often accentuated When all the words of...
متن کاملAutomatic detection of causal relations in German multilogs
This paper introduces a linguisticallymotivated, rule-based annotation system for causal discourse relations in transcripts of spoken multilogs in German. The overall aim is an automatic means of determining the degree of justification provided by a speaker in the delivery of an argument in a multiparty discussion. The system comprises of two parts: A disambiguation module which differentiates ...
متن کاملRecent Progress in Corpus-Based Spontaneous Speech Recognition
This paper overviews recent progress in the development of corpus-based spontaneous speech recognition technology. Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automatic speech recognition. Broadening the application of speech recognition depends crucially on raising recognition performance f...
متن کاملSpontaneous Speech Recognition and Summarization
This paper overviews recent progress in the development of corpus-based spontaneous speech recognition technology focusing on various achievements of a Japanese 5-year national project “Spontaneous Speech: Corpus and Processing Technology”. Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automat...
متن کامل